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<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 1 

His Leu lie His Asn Val His Lys Glu Glu His Ala His Ala His Asn 
15 10 15 



<210> 2 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> 2 

His Asp Asp His Asp Asp His Asp Asp His Asp Asp His Asp Asp His 

1 5 ■ 10 15 

Asp Asp 



<210> 3 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 



1 



<400> 3 > 

His Glu Glu His Glu Glu His Glu Glu His Glu Glu His Glu Glu His 

1 ' 5 10 .15 

Glu Glu 



<210> 4 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400> '4 

His Asp Glu His Asp Glu His Glu Asn His Glu Asn His Glu Asp His 

15 10 15 

Glu Asp 



<210> 5 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<400>5 

His Glu Asp His Glu Asp His Glu Asp His Glu Asp His Glu Asp His 

1 • 5 10 15 

Glu Asp 



<210> 6 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> affinity peptide 
<4 00> 6 

Asp Asp Asp Asp Lys 
1 5 



<210> 7 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> enterokinase cleavage site 
<400> 7 



2 



He Glu Gly Arg 
* 1 . ■ 



<210> 8 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> a factor Xa cleavage site 
<400> 8 

Leu Val Pro Arg Gly Ser 
1 5 



<210> 9 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> a thrombin cleavage site 
<400> 9 

His Pro Phe His Leu Val He His 
1 5 



<210> 10 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> a renin cleavage site 
<400> 10 

Cys Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
15 10 



<210> 11 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an immunological tag 
<400> 11 

Asp Tyr Lys Asp Asp* Asp Asp Lys 
1 5 



<210> 12 
<211> 11 
<212> PRT 

<213> Artificial Sequence 



3 



<220> . 

<223> an immunological tag 
<4'00> 12 

Cys Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 



<210> 13 
<211> 3426 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA sequence of vector containing cDNA of 
recombinant enterokinase 



<400> 13 

gacgaaaggg 

cttagacgtc 

tctaaataca 

aatattgaaa 

ttgcggcatt 

ctgaagatca 

tccttgagag 

tatgtggcgc 

actattctca 

gcatgacagt 

acttacttct 

gggatcatgt 

acgagcgtga 

gcgaactact 

ttgcaggacc 

gagccggtga 

cccgtatcgt 

agatcgctga 

catatatact 

tcctttttga 

cagaccccgt 

gctgcttgca 

taccaactct 

ttctagtgta 

tcgctctgct 

ggttggactc 

cgtgcacaca 

agctatgaga 

gcagggtcgg 

atagtcctgt 

gggggcggag 

gctggccttt 

ttaccgcctt 

cagtgagcga 

cgattcatta 

acgcaattaa 

cggctcgtat 

accatgatta 

gctcatgccc 

ccttgggtcg 

agggattggc 



cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa 
atgcagctgg 
tgtgagttag 
gttgtgtgga 
cgccaagctt 
acaacaagat 
ttgctctgta 
tggtgtcggc 



cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgctc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 
gaaggatcat 
cgatattgtc 
tttcgacgat 
cgcccactgc 



tataggttaa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttccaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 
ctcatccaca 
ggaggaagtg 
caacaggtct 
gtgtacggga 



tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtt'tattgct 
ggggccagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattctgtg 
aacgaccgag 
gcctctcccc 
gaaagcgggc 
ggctttacac 
tcacacagga 
atgtccacaa 
actccagaga 
gcggagcttc 
gaaatatgga 



ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagggt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 
agaggagcac 
aggagcctgg 
tctggtgagc 
gccgtctaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 



4 



tggaaagcag 
aggttgattg 
attgccatga 
ttaccagaag 
gcacttatat 
tcaaatgaga 
gcaggctatg 
tgccaagaaa 
ctgcctaatc 
tttctacatg 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
catatggtgc 
cccgccaaca 
acaagctgtg 
acgcgc 



tgctaggcct 
accaaattgt 
tgcatcttga 
aaaatcaagt 
atcaaggttc 
aatgtcaaca 
aagcaggagg 
acaacagatg 
gcccaggggt 
agctcgtaat 
cctggcgtta 
agcgaagagg 
cgcctgatgc 
actctcagta 
cccgctgacg 
accgtctccg 



gcatatggca 
cataaaccca 
aatgaaagtg 
ttttccccca 
tactgcagac 
acagatgcca 
ggtagattct 
gctcctggct 
gtatgcccgg 
tagctgagaa 
cccaacttaa 
cccgcaccga 
ggtattttct 
caatctgctc 
cgccctgacg 
ggagctgcat 



tcaaatctga 
cactacaata 
aactacacag 
ggaagaattt 
gtactgcaag 
gaatataaca 
tgtcaggggg 
ggcgtgacgt 
gtcccaaggt 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
ccttacgcat 
tgatgccgca 
ggcttgtctg 
gtgtcagagg 



cttctcctca 
aacggagaaa 
attatataca 
gttctattgc 
aagctgacgt 
ttacggaaaa 
attcaggcgg 
catttggata 
tcacagagtg 
gtcgttttac 
gcacatcccc 
caacagttgc 
ctgtgcggta 
tagttaagcc 
ctcccggcat 
ttttcaccgt 



gatagaaact 
gaacaatgac 
gcctatttgt 
tggctggggg 
tccccttcta 
tatggtgtgt 
accactcatg 
tcaatgtgca 
gatacaaagt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
tttcacaccg 
agccccgaca 
ccgcttacag 
catcaccgaa 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3426 



<210> 14 
<211> 269 * 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> protein sequence of vector containing cDNA of 
recombinant enterokinase 



<400> 14 



Met 


Thr 


Met 


He 


1 








His 


•Lys 


Glu 


Glu 








20 


Gly 


Ser 


Asp 


Ser 






35 




Phe 


Asp 


Asp 


Gin 




50 






Leu 


Val 


Ser 


Ala 


65 








Lys 


Trp 


Lys 


Ala 


Pro 


Gin 


He 


Glu 








100 


Tyr 


Asn 


Lys 


Arg 






115 




Met 


Lys 


Val 


Asn 




130 






Glu 


Asn 


Gin 


Val 


145 








Gly Ala 


Leu 


He 


Asp 


Val 


Pro 


Leu 








180 


Tyr 


Asn 


He 


Thr 






195 




Val 


Asp 


Ser 


Cys 




210 






Asn 


Asn 


Arg 


Trp 


225 








Ala 


Leu 


Pro 


Asn 



Thr Pro Ser Leu 
5 

His Ala His Ala 

Arg Glu Gly Ala 
40 

Gin Val Cys Gly 
55 

Ala His Cys Val 
70 

Val Leu Gly Leu 
85 

Thr Arg Leu He 

Arg Lys Asn Asn 
120 

Tyr Thr Asp Tyr 
135 

Phe Pro Pro Gly 
150 

Tyr Gin Gly Ser 
165 

Leu Ser Asn Glu 

Glu Asn Met Val 
200 

Gin Gly Asp Ser 
215 

Leu Leu Ala Gly 
230 

Arg Pro Gly Val 
245 



Lys Asp His Leu 
10 

His Asn Lys He 
25 

Trp Pro Trp Val 

Ala Ser Leu Val 
60 

Tyr Gly Arg Asn 
75 

His Met Ala Ser 
90 

Asp Gin He Val 
105 

Asp He Ala Met 

He Gin Pro He 
140 

Arg He Cys Ser 
155 

Thr Ala Asp Val 
170 

Lys Cys Gin Gin 
185 

Cys Ala Gly Tyr 

Gly Gly Pro Leu 
220 

Val Thr Ser Phe 
235 

Tyr Ala Arg Val 
250 



He His Asn Val 
15 

Asp He Val Gly 
30 

Val Ala Leu Tyr 
45 

Ser Arg Asp Trp 

Met Glu Pro Ser 
80 

Asn Leu Thr Ser 
95 

He Asn Pro His 
110 

Met His Leu Glu 
125 

Cys Leu Pro Glu 

He Ala Gly Trp 
160 

Leu Gin Glu Ala 
175 

Gin Met Pro Glu 
190 

Glu Ala Gly Gly 
205 

Met Cys Gin Glu 

Gly Tyr Gin Cys 
240 

Pro Arg Phe Thr 
255 



5 



Glu Trp lie Gin Ser Phe Leu His Glu Leu Val lie Ser 
260 265 



<210> 15 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> an amino acid sequence embodiment of the affinity 
purification site 

<400> 15 

His Asn His Asn His Asn His Asn His Asn His Asn 
15 10 



<210> 16 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 16 

catctcatcc acaatgtcca caaagaggag cacgctcatg cccacaac 48 

<210> 17 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 17 

cataaccata accataacca taaccataac cataac 36 

<210> 18 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 18 

catgatgatc atgatgatca tgatgatcat gatgatcatg atgatcatga tgat 54 

<210> 19 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 



6 



<223> a DNA sequence embodiment of. the affinity 
purification site 

<400> 19 

catgaggagc atgaggagca tgaggagcat gaggagcatg aggagcatga ggag 

<210> 20 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> a DNA sequence embodiment of the affinity 
purification site 

<400> 20 

catgatgagc atgatgagca tgagaaccat gagaaccatg aggatcatga ggat 



<210> 21 
<211> 9 
<212> PRT 
<213> Human 

<220> 

<221> VARIANT 
<222> 9 

<223> Xaa at position 9 is an amino acid with an 
aliphatic or amide side chain. 



<221> VARIANT 
<222> 2 

<223> Xaa at position 2 is an amino acid with an 
aliphatic or amide side chain 

<221> VARIANT 
<222> 3 

<223> Xaa at position 3 is an amino acid with an 
aliphatic or amide side chain. 

<221> VARIANT 
<222> 5 

<223> Xaa at position 5 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain. 

<221> VARIANT 
<222> 6 

<223> Xaa at position 6 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain. 



<221> VARIANT 
<222> 7 

<223> Xaa at position 7 is an amino acid with a basic 
side chain (except HIS) or an acidic side chain. 



<400> 21 

His Xaa Xaa His Xaa Xaa Xaa His Xaa 
1 5 



<210> 22 



<2ll> 6 
<212> PRT 
<213> Human 

<220> 

<400> 22 

His Arg His Arg His Arg 
1 5 



<210> 23 
<211> 9 
<212> PRT 
<213> Human 

<220> 

<221> .VARIANT 
<222> 2 

<223> Xaa = an amino acid having an acidic side chain. 

<221> VARIANT 
<222> 3 

<223> Xaa = an amino acid having an acidic side chain. 

<221> VARIANT 
<222> 5 

<223> Xaa = an amino acid having an acidic side chain. 

<221> VARIANT 
<222> 6 

<223> Xaa = an amino acid having an acidic side chain. 

<221> VARIANT 
<222> 8 

<223> Xaa = an amino acid having an acidic side chain. 

<221> VARIANT 
<222> 9 

<223> Xaa = an amino acid having an acidic side chain. 
<400> 23 

His Xaa Xaa His Xaa Xaa His Xaa Xaa 
1 5 



<210> 24 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Protease cleavage site 
<400> 24 

Asp Asp Asp Asp Lys 
1 5 



<210> 25 



8 



<2ll> 21 
*<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 25 

Leu Pro Pro Leu Ser Glu Leu lie Pro Leu Ala Ala Ala Glu Arg Pro 

15 10 15 

Ser Ala Ala Ser Gin 
20 



<210> 26 
<211> 38 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 26 

Ala Arg Lys Arg Lys Ser Ala Gin Tyr Ala Asn Arg Leu Ser Pro Arg 

15 10 15 

Val Gly Arg Phe lie Asn Ala Ala Gly Thr Thr Gly Phe Pro Thr Gly 

20 25 30 

Lys Arg Ala Val Ser Ala 
, 35 



<210> 27 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Translation of nucleotide coding sequence 
<400> 27 

Glu Phe Thr Gly Arg Arg Phe Thr Thr Ser 
1 5 10 



9 



